A Power-efficient 32bit ARM ISA Processor using Timing- error Detection and Correction for Transient-error Tolerance and Adaptation to PVT Variation

نویسندگان

  • David Bull
  • Shidhartha Das
  • Karthik Shivashankar
  • Ganesh Dasika
  • Krisztian Flautner
  • David Blaauw
چکیده

Razor [1-3] is a hybrid technique for dynamic detection and correction of timing errors. A combination of error detecting circuits, and micro-architectural recovery mechanisms creates a system which is robust in the face of timing errors, and can be tuned to an efficient operating point by dynamically eliminating unused guardbands. Canary or tracking circuits [4-5] can compensate for certain manifestations of PVT variation, however they still require substantial margining to account for fast-moving or localized events, such as Ldi/dt, local IR drop, capacitive coupling, or PLL jitter. These types of events are often transient, and while the pathological case of all occurring simultaneously is extremely unlikely, it cannot be ruled out. A Razor system can survive both fast-moving and transient events, and adapt itself to the prevailing conditions, allowing excess margins to be reclaimed. The savings from margin reclamation can be realized either as a per device power efficiency (higher throughput same VDD, same throughput lower power), or as parametric yield improvement for a batch of devices. Error-detection in Razor is performed by specific circuits which explicitly check for late arriving signals. Error correction is performed by the system using either stall mechanisms with 2 corrected data substitution, or by instruction/transaction-replay. Measurements on a simplified Alpha pipeline[2] showed 33% energy savings. In [3], the authors evaluated error detection circuits on a 3-stage pipeline, using artificially induced Vcc droops showing 32% throughput (TP) gain at same Vcc, or 17% Vcc reduction at equal TP. This paper presents Razor applied to a processor which has timing paths that are representative of an industrial design, running at frequencies over 1GHz, where fast moving and transient timing-related events are significant.. The processor implements a subset of the ARM ISA, with a micro-architecture design which has balanced pipeline stages resulting in critical memory access, and clock gating enable paths. The design has been fabricated on a UMC[6] 65nm process, using industry standard EDA tools, with a worst case STA signoff of 724MHz. Silicon measurements on 63 samples including split lots show a 52% power reduction of the overall distribution for 1GHz operation. Error-rate driven dynamic voltage (DVS) and frequency scaling (DFS) schemes have been evaluated. The micro-architecture is shown in (Fig. 1). The pipeline is balanced using a combination of up-front micro-architecture design and the low level path equalization performed by backend tools, such that all stages have very similar critical-path delay. The pipeline is …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An approach to fault detection and correction in design of systems using of Turbo ‎codes‎

We present an approach to design of fault tolerant computing systems. In this paper, a technique is employed that enable the combination of several codes, in order to obtain flexibility in the design of error correcting codes. Code combining techniques are very effective, which one of these codes are turbo codes. The Algorithm-based fault tolerance techniques that to detect errors rely on the c...

متن کامل

22.1 Razor II: In Situ Error Detection and Correction for PVT and SER Tolerance

Performance variation due to PVT uncertainty has led to an increased interest in adaptive designs. Traditional methods for adaptive design have used so-called canary circuits that mimic the critical-path delay of the actual design [1-4]. However, canary circuits require safety margins for possible mis-tracking and intradie PVT variations. They also have difficulty responding to rapidly changing...

متن کامل

Soft Error Mitigation Schemes for High Performance and Aggressive Designs

In this paper, we address the issue of soft errors in random logic and develop solutions that provide fault tolerance capabilities without logic duplication. First, we present a circuit level soft error mitigation technique which allows systems to operate without the performance overhead of soft error detection and correction circuitry. This is achieved by sampling data using our Soft Error Mit...

متن کامل

Delay-Slack Monitor for High-Performance Processors Using An All-Digital Self-Calibrating 5ps Resolution Time-to-Digital Converter

Advanced CMOS technologies have become highly susceptible to process, voltage, and temperature (PVT) variation. The standard approach for addressing this issue is to increase timing margin at the expense of power and performance. One approach to reclaim these losses relies on canary circuits [1] or sensors [2], which are simple to implement but cannot account for local variations. A more recent...

متن کامل

Reversible Logic Multipliers: Novel Low-cost Parity-Preserving Designs

Reversible logic is one of the new paradigms for power optimization that can be used instead of the current circuits. Moreover, the fault-tolerance capability in the form of error detection or error correction is a vital aspect for current processing systems. In this paper, as the multiplication is an important operation in computing systems, some novel reversible multiplier designs are propose...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009